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2 ■ Abstract 
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0^ ■ It is likely that the observed distribution of the microwave background teni- 

O , perature over the sky is only one realization of the underlying random process 

associated with cosmological perturbations of quantum-mechanical origin. If 
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wn| so, one needs to derive the parameters of the random process, as accurately as 

possible, from the data of a single map. These parameters are of the utmost 
importance, since our knowledge of them would help us to reconstruct the 
dynamical evolution of the very early Universe. It appears that the lack of 

5^ \ ergodicity of a random process on a 2-sphere does not allow us to do this with 

arbitrarily high accuracy. We are left with the problem of finding the best 
unbiased estimators of the participating parameters. A detailed solution to 
this problem is presented in this article. The theoretical error bars for the 
best unbiased estimates are derived and discussed. 
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I. INTRODUCTION 

The existing and planned measurements of the cosmic microwave background (CMB) 
anisotropies belong to the category of astronomical observations which promise a direct link 
to fundamental physics, and therefore they attract additional attention. 

The mere detection of the quadrupole anisotropy at the level 5T /T = 5 ■ 10~^ |l[] allows 
us to conclude that the Universe remains to be homogeneous and isotropic (all dimensionless 
deviations are smaller than 1) at scales much larger than the present-day Hubble radius Ih 
and up to distances about 500 times longer than l^ 0]. The significance of this result lies in 
the fact that such large scales are not directly observable now and will be accessible only to 
astronomers of a very remote future. At still longer scales, the homogeneity and isotropy of 
the Universe cannot be guaranteed, in the sense that some deviations can be larger than 1 
without conflicting the CMB observations p[. The transition from spatially flat cosmological 
models to open models does not affect these conclusions considerably |^. 

The mere existence of the long-wavelength cosmological perturbations, responsible for 
the observed large-angular-scale anisotropy, requires them to have special phases and to exist 
at the previous radiation-dominated stage in the form of standing, rather than travelling, 
waves [0. This conclusion follows from the Einstein equations, if we trust them to propagate 
the observed perturbations back in time up to, at least, the era of primordial nucleosynthesis 
without destroying the homogeneity and isotropy of that era. The distribution of phases can 
be only very narrow (highly squeezed) with two peaks separated by vr (see [^ and references 
there) - a sort of the "phase bifurcation" [Q. 

The already identified properties of the presently existing long-wavelength cosmological 
perturbations raise sharply the issue of their origin. Although several schemes are logically 
possible, it was argued 0] that the quantum-mechanical generation of cosmological pertur- 
bations was likely to be the one to do the job. If so, not only the existing requirements are 
satisfied but also some new specific consequences follow. 

In a broad sense, the quantum-mechanical generation of cosmological perturbations 
means the Schrodinger evolution of the intial vacuum state (no "particles" -perturbations) 
into the present-day multi-particle state (many "particles" -perturbations). Even the sim- 
plest, linear and quadratic, interaction Hamiltonians are capable of producing a variety 
of multi-particle states: coherent states (result of the action of a force, linear Hamiltoni- 
ans), squeezed vacuum states (result of the parametric influence, quadratic Hamiltonians), 
squeezed coherent states (combination of the two above). All these states have Gaussian 
wave-functions and are in this sense Gaussian. A single word "Gaussian" is too general to 
distinguish between these states. The difference between them lies in the statements regard- 
ing the mean values (zero - for squeezed vacuum states, nonzero - for the rest of the states) 
and the variances (equal - for coherent states, nonequal - for other states) of the conjugate 
variables characterizing the state, such as generalized coordinates and momenta, or quadra- 
ture components of the field, or, loosely, amplitude and phase, etc. In case of cosmological 
perturbations, there is no natural and unavoidable mechanism for generation of coherent 
states, but there is such one for generation of squeezed vacuum states: parametric (supera- 
diabatic) interaction of the quantized perturbations with strong variable gravitational field 
of the very early Universe (see IQ and references therein). The fact that the cosmological 
perturbations are being generated specifically in the squeezed vacuum quantum states (and 



not, say, in the "most classical" coherent states) dictates a number of properties of the per- 
turbations themselves and the CMB anisotropics caused by them 0. Squeezing is a physical 
phenomenon, not a formalism or a language. 

Let us imagine that the accurately measured distribution of the CMB temperature over 
the sky is decomposed over spherical harmonics (for precise definitions see Section 2): 

ijrp OO +1 

1=0 ■m=-l 

A priori, there may be nothing inherently random or quantum-mechanical behind this ob- 
served distribution of the cosmic temperature, in the same sense in which there is nothing 
inherently random or quantum-mechanical behind, say, the observed distribution of the 
stars of our Galaxy over the sky. If the measured CMB temperature map is a reflection of 
a given classical distribution of matter and gravitational fields, the derived fixed numbers 
cim is about all what we can extract from the map. The notion of the random temperature 
arises if the cosmological perturbations responsible for the anisotropics were randomly gen- 
erated. We should then interpret a particular perturbation field and the observed numbers 
cim caused by this field as one specific realization of the random process. The objective then 
is to find out and characterize the underlying random process, as accurately as possible, 
using the data of a single observed map. 

We maintain the view that the cosmological perturbations responsible for the observed 
anisotropics were generated quantum-mechanically. If they could be generated in coherent 
states, the mean quantum-mechanical values of the perturbation field would be nonzero, 
and they would be surrounded by small (at the level of the zero-point quantum oscillations) 
Gaussian fluctuations. Correspondingly, the theoretical distributions for the cim coefficients 
would have had nonzero means with small Gaussian fluctuations around them. Roughly 
speaking, in this case, the mean numerical values of 6T/T would be determined by the 
mean values of the perturbation field. In contrast, in case of the squeezed vacuum states, 
the mean values of the perturbation field and the mean values of the cim coefficients are zero, 
but the Gaussian fiuctuations around them are large. Roughly speaking, in this case, the 
numerical values of 5T/T are determined by the dispersion (square root of variance) of the 
perturbation field. In both cases, the words about Gaussian distributions should be taken 
with a great care. Whatever we are able to calculate presently, relies on the assumption 
that cosmological perturbations are weak and that the absolute value of 6T/T is a small, 
less than 1, number. On the other hand, if the variable 6T/T obeys a Gaussian (normal) 
distribution law, this quantity may take, even though with a small probability density, 
arbitrarily large values, which is in confiict with our initial assumption. In addition, the 
short wavelength perturbations became nonlinear in the course of evolution, and many extra 
physical processes were involved in the producing of the small-angular-scale anisotropics. 
However, in this paper, we will ignore these difficulties and will work with exact normal 
zero-mean distributions (Sections 2, 3). 

In case of squeezed vacuum quantum states, and the corresponding normal zero-mean 
distributions for the cim coefficients, the underlying random process is completely character- 
ized by the set of variances af (/ = 0, 1, 2, 3, ...) of the q^ distributions. These quantities are 
calculable if the cosmological model is postulated and, vice versa, the cosmological model 
can be determined if these quantities are known from observations. Specifically, the quan- 



titles af are calculable if the time dependence of the cosmological scale factor describing 
the very early Universe is chosen, and assuming that the rest of the evolution is known. 
Moreover, for simple cosmological models, the quantities af are related to each other and 
are all expressible through a small number of parameters. This happens, for example, if one 
assumes that the scale factor of the very early Universe had obeyed one of the power-law 
time dependences. In these cases, the problem of extracting cosmological information from 
a given map simplifies and reduces to the problem of determining those few parameters and 
testing those models. However, the real situation with the data can be more complicated 
1^. The observer may not be willing to believe any of theoretical cosmological models, even 
if he/she accepts the view that the underlying distributions should be normal zero-mean 
distributions. A more ambitious task than testing each of many possible models is the 
derivation of the true behaviour of the very early Universe from the observations, that is, 
from the observed set of numbers af. This is the position that we adopt in this paper. 
We begin from observations rather than from theory. Concretely, we want to answer the 
following question: What can be stated about the set of independent af on the grounds of 
a single observed map ? 

We assume that the sky coverage is complete, the foreground sources and contaminat- 
ing signals are under full control, the instrumental errors are negligibly small, the angular 
resolution is arbitrarily high, and the map is constructed from the raw data in the most 
intelligent and effective way P,^]. At the first sight, under these conditions, an arbitrarily 
accurate determination of all the af should not present a big problem. Indeed, the angular 
correlation function K{5)^ constructed as a result of averaging over many maps ("many 
universes"), has the form 

^T ^T 1 °° 

If the function K{S) is known, each of the af can be easily obtained from it. One simply 
needs to integrate over 6 the product of K{6) with the respective Legendre polynomial 
Pi{cos S). It is true that we have access to only one realization of the random process, not to 
infinitely many implied in the construction of the K{6), but this is not an obstacle by itself. 
If the process is ergodic [lyjl^ (see Section 4), the correlation functions can be built from a 



single realization, and the parameters of the random process, such as af, can be determined 
with arbitrarily high accuracy. What is required for the ergodicity of a random process in 
time or in 3-space is the decay of correlations in the limit of very large temporal or spatial 
separations. Then, the ensemble averages can be replaced by integrals over time or 3-space, 
and the parameters derived from one realization are true parameters with probabilty 1. 
In theoretical cosmology, the long-distance behaviour of the perturbation field is partially 
in our hands. It can be assumed to be appropriate, so that the field in 3-space can be 
made ergodic: the access to one realization would be sufficient for the determination of 
all parameters of the perturbation field. The difficulty apparently comes about when the 
3-dimensional ergodic process is being reduced to a 2-dimensional random process on a 
sphere, what effectively takes place when cosmological perturbations produce anisotropies 
in the temperature distribution over the sky. It appears that a random process on a 2-sphere 
can never be ergodic (in the sense of replacement of the ensemble averages by the integrations 
over the sphere) and the af can never be determined with arbitrarily high accuracy from a 



single map (Section 4). Possibly, this is a statement well known to mathematicians but we 
could not find an adequate reference. 

Without being able to find the true values of af we are left with the problem of esti- 
mating these parameters as accurately as possible. One can imagine that one is given a 
very precise map of the CMB sky, but the question is what to do with this map. This is 
a problem from the quite well developed theory of estimation and statistical inference (see, 
for example [0,0]). To evaluate a parameter we need to build an estimator - a random 



variable constructed from the original random process. The estimator is "unbiased" if its 
expectation value is equal to the true value of the parameter. And the estimator is the 
"best" if its variance is minimal among all possible estimators. In Section 5, we find the 
best unbiased estimator for af, and in Section 6 we find the best unbiased estimator for 
K{5). 

A concrete numerical value of the best unbiased estimator acquired at the observed map 
is the best unbiased estimate of the corresponding parameter. Usually, one is not satisfied 
with the best estimate alone, but wants to surround the estimate by appropriate error bars. 
This requires new definitions and criteria (see Section 7). Approximately, but not exactly, 
the size of the error box is characterized by the variance of the best unbiased estimator. 
Not surprisingly, since we have limited our discussion to normal zero-mean distributions, 
the most "natural" estimators turn out to be also the best unbiased estimators, and the 
maximum likelihood estimators, etc. In this way, this paper makes contact with previous 
work on the subject [|14 . 



Before concluding the Introduction we need to make two comments. 

First, our analysis is built on the assumptions that the observed map is only one realiza- 
tion of a random process, and that the underlying distributions are normal zero-mean distri- 
butions. These assumptions are in fact consequences of the parametric quantum-mechanical 
generating mechanism of the perturbations, but they are also testable hypotheses on their 
own. Logically, one needs first to show that one is dealing with a random process before try- 
ing to find out its characteristics. If we were experimenting with a noisy voltage generator, 
we could compare several sufficiently long records in order to argue that we were dealing 
with the different realizations of one and the same random process. In cosmology, we have 
control over only one record, only one CMB map. The assumptions made above can hardly 
be proven rigorously, but one can possibly find evidence in their support. Alternatively, 
they can be disproved at some level of confidence. One possibility to test these assumptions 
was indicated in Ref. 0. The product of two random variables 6T/T{ei) and 6T/T{e2) is 
a new random variable v. The probability density function (p.d.f.) for v was derived, and 
its functional form was shown to be quite special [6]. The mean value of v is K{6) but the 
variable v is not supposed to be a good estimator of K{S) since the variance of v is very big. 
However, this big variance should be present in the original observational data (before any 
angular integrations over the map are performed) if our statistical assumptions are correct. 
It looks unlikely that the quite special functional form of the p.d.f. for v can be mimicked 
by the anisotropics caused by perturbations of any other origin. In view of the forthcoming 
massive " pixelization" of the CMB sky, it will probably be possible to test directly whether 
the map values of the v satisfy, at least approximately, the theoretically derived p.d.f. for 
V. In this paper, we effectively assume that this is the case. 

Second, in our analysis we do not need to specify the nature of the cosmological per- 



turbations responsible for the observed large-angular-scale anisotropy. They can be density 
perturbations, or rotational perturbations, or gravitational waves. However, in order to 
build a correct general picture, it is very important to know what kind of perturbations we 
are actually dealing with. Hopefully, this question can be answered in future observations, 
with the help, for example, of polarization measurements (for a recent paper on the subject, 
see [0). So far, one can only rely on the theory. The quantum- mechanical generating 
mechanism, originally developed for gravitational waves, can also be applied, under certain 
conditions, to density perturbations and rotational perturbations. The contribution of each 
type of perturbations to the 5T /T can be calculated. According to the calculations of Ref. 
I6| , if the observed large-angular-scale anisotropies are indeed caused by cosmological per- 



turbations of quantum-mechanical origin, gravitational waves are at least as important as 
density perturbations and provide a somewhat larger contribution than density perturba- 
tions. 



II. VARIOUS REPRESENTATIONS FOR 5T/T 

The quantity which appears naturally in the theory of the CMB anisotropies is a relative 
variation of the temperature seen in a given direction e on the sky: 6T/T{e}. This quantity 
is a function of the angular coordinates on the celestial sphere 

^(e)^^(^,¥.). (2.1) 

It is convenient to define three different representations for this function: 

-7f7iO,^) = Y: E [aimYUO,v) + alXiW,^)l (2-2) 

1=0 m=~l 

= f.Uo^^^^ + J:[h'i^Yao,^)+htj^ao.^)]), (2.3) 

1=0^ V^ m=l ^ 

00 +/ 

= E E cimYUe,^), (2.4) 

i=0 m,=-l 

where the orthonormal spherical harmonics, either complex Yj^, or real Y^q/v^, YJ^ and 

yj^, are described in the Appendix. In what follows, the index / runs from to oo, and the 

index m runs either between — / and +/ or between 1 and /, as will be explicitely specified. 

Although the transitions between Eqs. ( ^^ , (p^ ), and ( p. 41) are quite straightforward. 



each of the representations has its own advantages and we will use them below. Since 
every factor v2 can eventually prove to be very important, we need a rigorous, even if 
somewhat pedantic, description of these representations. The coefficients aim are complex 
and can be expressed in terms of their real and imaginary parts: aim = ^Li + ^'^/m- I^ 
this a-representation, the function 6T/T{e) is manifestely real, but the number of the a- 
coefficients is larger than necessary. The b-representation is a little cumbersome, but it 
has to be regarded as canonical in the sense that it contains only real and independent 
coefficients 6^ {A = c, s) and does not require any extra constraints. The properties of 
other representations will be derived from the properties of this one. Finally, the coefficients 



Clm of the c-representation are complex, and in order to have real ST/T{e), they must satisfy 
the relationship cf^ = Q^_m- They can be also written as q^ = c[^ + ^cj^- Then, the last 
relationship implies c[^ = c[_^ and cj^ = -4,-m- 

Let us now describe how these representations are related to each other. 

The relationship between the b-representation and the a-representation is expressed 
through the following equations 

6^0 = 2a[o, (2.5) 

6L = v^(aL + a[„ J m>l, (2.6) 

6L = -v^(«L - <- J "^>1- (2-7) 

The link between the c-representation and the a-representation is given by 

Qm — Ct/m + 0,1 -m- 

The link between the c-representation and the b-representation can be written as 

Qo = %> (2-8) 

Clm = -^(feL - ^btJ m>l, (2.9) 

cim = ^ibl^m + ^K-m) m<-l. (2.10) 

Using the previous equations it is easy to check that Q^_m = l/v^(^im + ^^fm) — c*im- We can 
also express the real and imaginary parts of the coefficients q^ in terms of the coefficients 
h{^. Explicitly we have 
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If m < —1, then bf^ and bf^ have to be replaced by bf_^ and bf_^, respectively. Finally, 
Eqs. ( p.8| )-( prT0D can be inverted, and one can express the coefficients b-j^ in terms of the 



coefficients cim- One obtains 

(2.11) 
m > 1, (2.12) 

J m>l. (2.13) 
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III. THE PROBABILITY DENSITY FUNCTIONS, MEAN VALUES, AND 

VARIANCES 

We need to formulate our statistical assumptions about the CMB temperature. We will 
start from the b-representation. Our assumptions are as follows: 1) all the coefficients 6^q, 
bfj^ and bf^ are statistically independent random variables, 2) each individual variable is 
normally distributed with a zero mean, 3) all variables with the same index / have the same 
standard deviation ai. In other words, the probability density functions for the b-coefficients 
are given by the expressions 
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/(6?o) = -j^e -r , (3.1) 



1 






f(btJ = ^^e -r m>l, (3.2) 



1 



(bf f 



f{bL) = ^^e -f m>l. (3.3) 



The 6T/T taken in a given direction is a random variable, while the 6T/T treated as a 
function of 6, if is a. random (stochastic) process. 

Having the p.d.f.'s one can calculate various useful expectation values. For the mean 
values of the 6-coefficients, one obtains 
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(%) = {biJ = iMJ = m > 1. 
For the quadratic combinations, the result is 

KoKo) = <Ki., (3.4) 

(Km^Km^) = KrmKm,) =0 7^, 7772 > 1. 

In a similar manner, one can also determine the quartic combinations. 

We can now deduce the statistical properties of the two other representations. For the 
a-representation, all the coefficients a[^ and a\^ can be taken as statistically independent, 
and their p.d.f.'s can be written as 

c^ 2{ar )^ „ 2(ai )2 

O ^ Iva' O ^ Im' 

To find the expectation values one can use these p.d.f.'s, and can also make a consistency 
check with the help of equations (|2.5| )-( p7D and (|3.1| )- (|3.3| ). The mean values of a\^ and a\^ 
are obviously zero (and therefore this is also the case for a^^ and af^): 

(«L) = («L) = 0. 

The quadratic combinations of a[^ and a\^ have the values 
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\'^hmi'^l2m2) ~ \'^limi^l2m2) ~ ^5 



from which one derives 
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\(^hm\(^l2m2l ~ U, \'^hm\'^l2m2l ~ '^(^hl2^mim2- 

Some of nonvanishing quartic combinations are given by the expressions 

2 2 2 2 
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2 2 

\(^hmiO'l2m2(^h^m3^l4m4) ~ 7 (,0/i/30mim30/2/40m2m,4 + 0/j^/40mim40/2/30m2m3 J) 

and other nonvanishing combinations can be obtained by permutting the indices r, i. 

Finally, starting from the postulated distributions in the b-representation, one can also 
establish the corresponding equations for the c- representation. All the coefficients c[q, c[^ 
and cj„ {m > 0) should be statistically independent, and their p.d.f.'s should be written as 

ficio) = -^^e ^ 



and 

1 ^ Ira' 1 ^ Im' 

f{cU = ^e ^r, f(c\J = ^e ^T, m>l. 

Note that the standard deviation for c[q is different from that for c[^ and c]^. Obviously, 
the mean values of the coefficients c[q, c[^ and cj^ vanish 

(4) = {ciJ = {c\J = 0. 

The mean value of the quadratic combination of c[q is given by 

For other coefficients (where mi and 1712 are not both equal to zero) one obtains 

2 2 

\^hm\^l2m2l ~ 7^"'i'2"m,im2 H 7~0/j^/20mi,-m2) 

2 2 

\^hm\^l2m2l ~ iy^hl2^rnim2 '^(^hl2^mi-m2) 

\^limi^l2m2/ \QimiQ2m-2' 



This leads to 

The last equations are also valid for mi = 1712 = 0. 

One nonvanishing quartic combination is given by the expression (others can be obtained 
by the complex conjugation): 

\QimiQ2ni2'-^'3'T),3Q4m4/ ^I^^l2^lil3^mi,—m3^l2l4^m2,—m4 ' ^I^^l2^lili^mi,—m4,^l2ls^m2,—m-} 

+ (^h^^ls^hh^rm-miShU^ms-mi- (3-6) 

Let us now introduce new random variables which will play an important role in what 
follows. Let us define random variables of, bf, and cf by the equations 

I 

2 -- ST^ * 

C'l — 2^ '^ImO'imi 

m=—l 

m=l 
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2 Y^ * 

C-l — 2^ ClmCim- 
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Using Eqs. ( p.ll| )-( p?T3D it is easy to show that hf = cf. 

m values of these new ran( 

2{af) = {bf) = {cf) = {21 + l)at 



One can compute the mean values of these new random variables 

2\ /r,2\ / 2\ /o; I i\_2 



and their variances 

(«f)-(«f)' = ^(«?)^ (3.7) 

{bt)-{bf)' = {ct)-{cf)' = ^^{cf)'. (3.8) 

It is important to note that the above relationships are trivial consequences of the postulated 
distributions ( |3.1| ) - ( |3.3| ). These relationships are always true, regardless of what and how 
is measured, and regardless of whether we have access to only one realization of the random 
process (only one sky or portion of sky) or to infinitely many. (But if one wants to use a 
"cosmic" word, one is free to call the relationships ( |3.7| ), ( |3.8| ) the "cosmic variance".) 

The original probability distributions dictate also the p.d.f.'s for these quadratic vari- 
ables. They are the so-called x^ - distributions. Denoting x^ = bf/crf and n = 21 + 1 one 
can write [IT! 



, (;^2)(n-2)/2^^xV2 

^^^ '""^ (n/2-l)!2«/2 ■ 
and for the random variable 6? we have 
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«"?> - („/2-l)!2~/. (3-8) 

So far, we have been concerned with the statistical properties of the coefficients in the 
expansion of the random process 6T/T over the orthonormal spherical harmonics. We can 
now discuss some properties of the random process itself. No doubt, these properties follow 
from the properties of the coefficients. Obviously, the process is isotropic in the sense that 
the mean value of the 6T/T is one and the same number (zero) in every direction on the sky 

(^(e)) = 0. (3.10) 

The process is also homogeneous in the sense that the angular correlation function depends 
only on the angle 6 between two directions, but not on directions themselves. For each pair 
of vectors ei and 62 separated by the angle 6, the angular correlation function takes the form 

{^{e,) — {e,)) = K{6) = - ^af (2/ + l)P,(cos<5), (3.11) 

where Pi{cosS) are the Legendre polynomials. 

The three-point correlation function [as well as all correlation functions containing an 
odd number of term 6T/T{ei)] vanishes: 

, ST , ^ , 5T , ^ , 5T , ^ , , 

( — (ei) — (el) — (e3)) = 0. 

The four-point correlation function is given by the expression 

(^(61)^(62)^(63)^(64)) = TT^E E ay„,{2l+mm + l) 

J^ -L J^ J^ y^T^j i=Qm=0 

Pl{cOs6i3)Pm{cOs624) + P/ (cOS (5i4)Pm(cOS ^23) + P/ (cOS 5i2)Pm(cOS (534^ 

where the symbol 6ij denotes the angle between vectors Cj and Cj. In a similar manner one 
can derive the higher-order correlation functions and express them through the lower-order 
ones. 

All the derived expressions are consequences of the postulated distribution functions 
( p.l|) - ( p.3|) . We do not possess a rigorous mathematical proof of the statement that there 
exists a one-to-one correspondence between Eqs. ( pTI|) - (|3l3| ) and the fact that the relevant 
cosmological perturbations are placed in the squeezed vacuum quantum states. However, 
we believe this statement is indeed true. At any rate, the quantum-mechanical expectation 
values coincide with the corresponding ensemble averages if the appropriate identifications 
are made [6]. In particular, the quantum-mechanical expression for the angular correlation 
function 

ST ST °° 

(0|^(ei) — (62)10)= E QP,{cosS) 

I—} 

coincides with Eq. (|3.11|) if we identify 
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1-{21 + l)af = Q. (3.12) 

The quantity Ci explicitely contains the square of the Planck length. This quantity is 
calculable when the law of cosmological evolution and the sort of cosmological perturbations 
are specified. The value of the lowest multipole Imin is also determined by the sort of 
perturbations. In this sense, the abstract quantities af, which completely characterize the 
random process (pTTp, ( p.lD - (|3.3| ), are also calculable and are given by the expressions 



IV. ERGODICITY OF RANDOM PROCESSES ON A LINE AND ON A SPHERE 



Let us first recall the ergodic theorem |T^,n] for a time-dependent random process x(^, t) 
defined on an infinite t-line. The symbol ^ indicates different possible realizations. Let us 
assume that the process is stationary, that is, its mean value does not depend on time 

(x(^,t)) = const = m, 

and its correlation function depends only on the time difference 

{x{U + r)x{U)) = B{T). 

To find the ensemble average of x(^, t) at a fixed moment of time, one takes a large number 
A^ of different realizations and calculates the arithmetic mean of the observed values: 

1 ^ 

In the limit of A^ going to infinity, the quantity / tends to the theoretical ensemble mean 
{x{^,t)) of the random process. 

Let us now consider a situation in which we have access to only one realization x(.^0;^) 
of the random process. What can we say about m and B{t) on the grounds of this single 
realization ? The ergodic theorem defines the conditions under which the ensemble averages 
can be replaced by the time averages, so that the m and B{t) can be found from the time 
integrations of x(^o, t)- 

Introduce a random variable xt(0 defined by the equation 

^T(0 = ^/_^J^(e,t)dt. (4.1) 

This variable is an unbiased estimator of m because {xt{C,)) = m. However, we can say 
much more when we take the limit T —>■ cxd. If the process is such that 



lini ^ rji?(r)dr = 0, (4.2) 
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then 

1 /■+'^ 
im - 

T 



1 f+'^ 

lim — - / x(f,t)dt = m, 



for every realization ^. When condition ([4.2| ) is satisfied, the process is called mean-ergodic. 
The condition ( ^.21 ) can also be expressed as the requirement a^^ ^ in the limit T — >■ cxd, 
where a^^ is the variance of the random variable defined by Eq. ( |4.1| ). This explains why 
one is capable of deriving from a single realization a true parameter of the ergodic random 
process (in this case, the mean value m) with probability 1. Indeed, for every arbitrarily 
small e, one has the Tchebysheff inequality 

2 

P{\xT-m\ <e) > 1-^, 

and the probability goes to 1 when a^^ goes to 0. A sufficient condition for the validity of Eq. 
( [4.2|) is the vanishing of the correlations at large temporal separations: liniT-^oo B{t) = 0. 

More stringent conditions should be satisfied for the process to be correlation-ergodic, 
that is to have 

hm ^ r^x(e,t + r)x(e,t)dt = i?(r), (4.3) 

for every ^. Here, for simplicity, we will restrict ourselves to normal zero-mean (m = 0) 
stationary processes. The equality ( |4.3| ) allows one to find the variance of the process from 
its single realization. This equality takes place if and only if the process is such that 

hm ±, r''\Bir)\'dT = 0. (4.4) 



For normal zero- mean stationary processes, and when the condition ( [4.4|) is met, all the 
higher-order correlation functions can also be replaced by the time averages. Note that the 
integrand in Eq. ( ^.4| ) is a strictly positive function. The limit of the expression ( |4.4| ) is zero 
because the denominator goes to infinity in the limit T -^ oo. 

We will now try to apply the notions formulated above to a random process on a sphere. 
The problem of our interest is the CMB temperature distributed over the sky. Strictly speak- 
ing, the quantum-mechanically generated cosmological perturbations form a non- stationary 
process: the squeezing makes the temporal correlation function a function of individual mo- 
ments of time, and not only of the time difference. This property may turn out to be very 
important for the future observations of short gravitational waves, but is irrelevant for our 
discussion of very long- wavelength cosmological perturbations responsible for the microwave 
background anisotropics, since the time scale of variations is much much longer than the 
interval of time between possible missions for the CMB observations. Most importantly, the 
stationarity - a necessary (but not sufficient) condition for a time dependent process to be 
ergodic - is replaced in our case by the analogous properties of isotropy and homogeneity of 
the process on a sphere, see Eqs. ( p.lO[ ), ( p.ll| ). So, at least the necessary conditions for our 



process to be mean-ergodic and correlation-ergodic are satisfied. 

To check the analog of the condition ( [4.2| ), we will use K{6) instead of B{t) and will 
replace the time integral divided by T by the integral over a sphere divided by An - the 
surface area of a (unit radius) sphere. The result is 
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4^i^"^''W = 4^-»- 



The right-hand-side of this equation is zero when ctq is zero, that is when the monopole 
coefficient b^Q in the expansion ( p.3|) is identically zero, see Eq. ( p.4|) . In this case, our 
process is indeed mean-ergodic since the integral over a given map 



, , dn—(e) = ^=U 



'00 



vanishes, and the average over the map coincides with the ensemble average (|3.10|) . 



What we really would like to have is the correlation ergodicity of our process. In this 
case, we would be able to replace the ensemble averaging by the integration over the sphere, 
and to find the K{5) and hence all the af from a single map. Unfortunately, this is exactly 
what we cannot have on a sphere. The necessary and sufficient condition ( |4.4| ) translates 
into the requirement 

i^idf!|A-WP = 0. (4.5) 

The left-hand-side of this equation can be calculated using Eq. (|3.11|) and the orthogonality 
properties of the Legendre polynomials. The resulting equation 

E(2^ + i)^f = 



167r2 ,^^ 

can only be true if all the af are zero. In contrast to processes on an infinite line or in infinite 
space, we do not have here an infinite volume factor in the denominator of the left-hand-side 
of Eq. (|4.5|) to enable it to vanish. We are bound to do the best what we can do with a 
single map - try and find out the best unbiased estimates for the parameters af and K{5). 

V. THE BEST UNBIASED ESTIMATOR FOR THE al 

Let us denote an estimator for the af by fi. This is a random variable constructed from 
the original random process. One realization of this process is the observed map. The most 
general quadratic expression for fi is given by 

h = I^J^^dfiidfi^^^Kei, 62)^(61)^(62), (5.1) 

where dfl = sin6'd^d(y5. The function w; (61,62) is a weight function to be determined from 
the requirements that the estimator fi is unbiased and the minimum- variance (the best). In 
this formulation, the problem was essentially solved in Ref. [O. We refine and expand the 



arguments of Ref. |jT7|. 



An arbitrary weight function iw; (6*1,62), being a function of two sets of angular coordi- 
nates, can be expanded over two sets of orthonormal spherical harmonics: 
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w^Kei, 62) = 2. 2^ 4ioo — iK -JK- 

i=Oi=0 v^ V^ 

+ zJzJM/ijon * 77:: ^/n(e2) + C?Kjon * /7T '^/n(e2)] 
ij n=l V^ V^ 

i,j m=l V^ V^ 

+ EE[4;^>nn>^4(ei)V};(e2) +rf-_y4(ei)>^;(e2) 



+ 4^>.n>^4(ei)l^;(e2) + d^[,mr^Z.ie^y;rrie2)]. (5.2) 

In this expression, all the coefficients d{l^^^ {A, B = c,s) are real. The weight function 
may be asymmetric with respect to the interchange of ei and 6*2, but the antisymmetric 
part of this function will not contribute to Eq. ( |5.1| ) anyway. To simplify calculations, we 
require this function to be exphcitely symmetric, w/(ei,e2) = w/(e2,ei). This means that 
the d-coefficients must obey the relationship 

MB ^ ^BA 

^lijmn Ijinm' 

We will mostly use another representation for wi defined by: 

OO OO j j 

wi{ei,e2) = J2J2 J2 E diijmnyim{ei)Y*^{e2). (5.3) 

i=0 j=0 m=—i n=—j 

In this case, the coefficients dujmn are complex, and since the function wi{ei, 62) is real, they 
have the property 

or, if we introduce the real and imaginary parts of dujmn, i-e. dujmn = d^^j^^^ + id\^j^^, 

rj-r = fjr d^ = —el''- (^ A) 

lijmn lij,—m,—n^ lijmn lij,—m,—n' V^ / 

In addition, the weight function is symmetric in this representation if 

(^lijmn (^lji,—n,—m- 

The link between the two representations is expressed by the equations 

U'lijOO — W/ijOO' 



U/7inn S _\_(JCC _ -JCS 



{dfijO,~n ^dfijQ ) n < 1, 



, _ J ^{dhjmO ^"/iimo) 171 > 1 



V2^ 

ifz/cc _|_ ^ss _ -Jsc _|_ -Jcs \ m > ^ r) > ^ 

2\"'lijmn ^ '-''lijmn '■'^Hjmn ^ '''^lijmn) in ^ l ib ^ l 

2\'^lijm,—n '^lijm,—n '''^lijm,—n '''^lijm,—nJ 
2\ lij,-mn "'lij,-mn ' ''"'lij,-mn ' ''"'lij,-mnJ 



, _ , 2\"'lij~m-n'"'lij,-m,-n''^"'lij-m,-n '^"'lij-m-n) '^ ^ l n< i 

Ulijmn \ l(Jcc — rlss —■jrl'^'^ — iri^^ ] 77? > 1 77 < —1 

I 2^ lijm-n '^lijm-n '''^lijm-n '''^lijm-nJ lib ^ L lb ^ i. 

>\^lij,-mn ~ d-lij-mn + *%7,-mn + '''^lij,-mn) Ul < —1 n>l. 
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The angular integrals in Eq. (|5.1|) can be performed explicitely. This integration returns 
us from the random process 5T /T to the random variables - coefficients in the decomposition 
of the 5T /T - and allows us to express the estimator /; in terms of the general quadratic 
combination of these coefficients. As a consequence, we arrive at the following expression 
for the fi in terms of the b-coefficients: 

CO oo ^ 

i 

+ Z^ {"■lijmO'^jO^im + ^lijmO'^jO^im) + Z^ [d-HjOn'^iO'^jn + '^/ijOn"iO"in) 
m=l n=l 

i J s 

_i_ \^ S^iH'^^ H^ }f ^ d"'^ W V ^ d'^" V W -I- (7'*'* V V '\\ {^^^ 

^^ / J / j \^liim,n"im"in '^ "'lijmn im^jn '^ lijmn^im jn '^ lijmn^im^jn) j' K^'^J 

m=l n=l 

In the representation defined by Eqs. ( |5.3D , (p.4|) , the last equation takes the form: 

oo oo i j 

fl=J2J2 J2 J2 dlijmnC*^Cjn. (5.6) 

1=0 j=0 m=—i n=—j 

We have introduced the general expression for the estimator and can now subject it to 
the desired requirements. Let us start from the mean value of the estimator. Using Eqs. 



]6D and ( |3.5| ) we obtain 



OO OO I 



1=0 j=0 m=—i n=—j 

OD i 

2_^ i / ' "'liimm- 
i=0 m=—i 

Since the second set of equations ( |5.4| ) guarantees 

i i 

Er/* = — S^ H^ = n 
liimm / j liimm ' 

m=—i m=—i 

the mean value of /; reduces to the following manifestly real expression: 

OO i 

U'l) = Xl'^i Yl duimm- 
j=0 m=—i 

We want our estimator to be unbiased, that is, we impose the condition (/;) = af. This 
requirement can only be achieved if 

i 

/ . d^^^ = On. (5.7) 

m=—i 

Equation ( |5.7| ) form the first set of constraints on the weight function wi and define the 
family of unbiased estimators. 
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The next step is to find, among the unbiased estimators, the one whose variance is 
minimal. The variance a? of the random variable /;, 

4 = (fi) - (/')'' 

can be calculated using the definition ( |5.6| ) and the equations (p.6|). The general expression 
for the variance reduces to the form 

OO OO j j 

^/, =2EEE 11 duj^ndlj^yia]. (5.8) 

i=0 j=0 m=—i n=—j 

We have to minimize this expression taking into account the constraints ( |5.7| ). The expres- 
sion ( ^.8|) is the sum of strictly positive terms. To minimize this sum, we should set to 
zero as many terms as possible. First, we need to set to zero all the d-coefficients which 
do not participate in the constraint (|5.71 ) and whose presence in the sum only increases the 
variance. Thus, we require to vanish all the coefficients d\^j^^ and those of d^ij^n which have 
indices i y^ j and/or m y^ n. To minimize the remaining variance under the constraint ( ^.7| ), 
we introduce the Lagrange multipliers Aj and write 

y oo i oo i s 

5(2Y: E id^UmJ'^t + EH E dlumm-5u)]=0, 
i=0 rn=—i i=0 m=—i 

Since the d\^^^^ are treated as independent variables, the variation of the previous expression 
provides us with 

^d\,,m^cjt + A, = 0. (5.9) 

The sum over m of these equations together with Eq. ( [5.7| ) determine the quantities Aj: 

Aa% + {2i + l)\i = Q. (5.10) 

Using the expression (|5.10|) for the Lagrange multipliers A, in Eq. (|5.9| ) allows us to write 



IT r 

^liimm O ' _i_ 1 

Thus, taking into account all relationships, we obtain the complete set of constraints on the 
weight function ( |5.3| ) which make the estimator fi unbiased and best: 

dlijmn ~ o- _|_ 1 ^li^ij^rnni d^^^,^ = 0. (5-llj 



In the representation ( |5.2| ), this amounts to 

1 



dl^JOO=^-^Sl^S^,. (5-12) 



1 
2i + l 

with all other coefficients being zero. 



dlijmn — dlijmn ~ o„- i 1 "li"ij"rnn, (5.13) 
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Having found all the d-coefficients, we can write the weight function wi{ei, 6*2) explicitely. 



Using the definition ( |5.3|) and the found expressions (|5.11| ) we obtain 



wi{ei,e2 



1=0 j=0 m=—i n=—j 



1 ' 



21 + 



- Y: Y,miei)Y;^ie2) (5.14) 



m=—l 



= ^Piicos5i2), (5.15) 

where, in the last step, the summation theorem for spherical harmonics has been used. 
Therefore, the best unbiased estimator for af can be written as: 

^' = ^/^J^,d^idn2PKcos5i2)^(ei)^(e2). (5.16) 

This formula answers the question what to do with a given map in order to get a concrete 
number - the best substitute for the true parameter af. The answer is to perform with the 
map the integrations prescribed by this formula. In fact, the integrations can be further 
simplified. 

The estimator ( p.l6| ) contains a double integral of the product of two functions ST/T and 
therefore can be called a quadratic estimator. However, this estimator can be presented as 
a product of two linear estimators, i.e. a product of two appropriate single integrals of the 
function 6T/T. Indeed, using the summation theorem, Eqs. (|5.14|) , ( |5.15|) , formula (|5.16|) 
can be written as 

1 ' /■ 8T r 8T 

f^ = ^[^ E J^,<^^iYimiei)jriei)J^^dn2Y;je2) — ie2). 

This formula shows that it is sufficient to perform one appropriately weighted integration 
over the sphere with further multiplications and summations. Moreover, the remaining 
integrals define the Qm coefficients. So, we obtain the following expression for the best 
unbiased estimator in terms of the original random coefficients: 

/. = ^ E_^cLq^ = ^. (5.17) 



Of course, this is the same expression which could be obtained by inserting Eq. ( |5.11|) into 
Eq. (^) or by inserting Eqs. ( |5:T2| ), ( |513D into Eq. d? 



We have found the estimator with the smallest possible variance among all unbiased 
estimators. It is useful to write this minimal variance explicitely. This can be found either 
from Eqs. ( |5.8| ) and ( |5.11|) or from Eqs. ( |5.17|) and ( p.8|) . The result is 



^2 _ -^r^ 
21 + 1 



Of. = — T(Ti . 



VI. THE BEST UNBIASED ESTIMATOR FOR THE K{6) 

The best unbiased estimator for af is also the best unbiased estimator for the multipole 
moments Ci of the correlation function K{S), see Eqs. ( |3.11| ), ( p.l2| ). Since the parameter 



K(S) is a combination of the parameters af, it is not surprising that the best unbiased 
estimator for K{6) turns out to be the same combination of the best unbiased estimators 
for af. It is interesting and instructive to follow this relationship in detail. 

Let us denote an estimator of the K{6) by f{S). This is a random variable constructed 
from the random process 6T/T. The most general (quadratic) expression for the f{6) can 
be written as 

1(5) = J^J^^dn,dnMeue2,6)^ie^)^-^ie2), (6.1) 

where the 5 is a fixed angle, whereas the angle between variable directions ei and 62 will 
be denoted 612. The arbitrary weight function w{ei,e2,S) can be expanded, without loss of 
generality, over the Legendre polynomials Pi{cos6): 

1 00 
w{ei,e2,S) = 2" Xl(2^ + l)^«(cos(5)wz(ei, 62). 
^^ 1=0 

The estimator f{6) takes the form 

1 °° r r ST ST 

/W = ^T.i^l + l)Pi{cos6)J^J^,^dn,dn2Wi{e,,e2) — {e,) — {e2) 

-| 00 
= — 5:(2/ + l)/,PKcos(5). 
^^ /=o 

Now, we want the f{6) to be unbiased and best estimator of K{S). The estimator f{S) 
is unbiased if (fi) = af, and it is the best if the variance of the fi is the smallest one. In 
other words, we return to the solved problem (Section 5) for the estimator //. Using the 
results of the previous Section, we can write for the best unbiased estimator of the K{6): 

f{S) = -T.Pii^os6)cf. (6.2) 

^^ 1=0 

The weight function w{ei,e2,S) is given, taking into account Eq. ( |5.15|) , by 



1 00 
w{e,,e2,6) = jj-^J2i'^l + l)Pi{cos6)Pi{cos6i2). (6.3) 

"^ 1=0 



(47r)2 



This last equation can also be written as 



-| 00 I 
w{ei,e2,S) = —Y, J2 Pi{cos6)Yi^{ei)Yi*^{e2) 

1=0 m=—l 



showing that the double integral in Eq. ( |6.1|) decays into the products and summations of 
the appropriately weighted single integrals. This form of the weight function permits an 
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immediate recovery of the already known result ( |6.2|) . The variance of the best estimator 
(PD is: 



4w = ^E(2^ + lK^f(cos5). 



^^ 1=0 



The derived formulas answer the question what to do with a given map in order to derive 
the best unbiased estimate for the correlation function K{6). The outlined prescription 
essentially goes through the derivation of the best unbiased estimate for af. However, the 
weight function (|6.3|) allows also a different procedure for the derivation of the estimator 



and the estimate: the direct integration of the map, but with the help of the (5-function. 

Let us denote cos 5 = x and cos(5i2 = xq. Let us present (define) the function 6{x — xo) 
as an expansion over the Legendre polynomials: 

oo 

S{x - xo) = ^aiPi{x). (6.4) 

1=0 

To find the coefficients ai, multiply both sides of Eq. ( |6.4| ) by Pmix) and integrate by x 
from —1 to 1. The result is 

ai = — t; — Pii^o) 



and 



°° 2/ -I- 1 
6{x-xo)=j:^Pii^)Pii^o)- 

1=0 ^ 



Thus, the weight function ( p.3|) can be written as 



w 



;ei,e2,(5) = — (5(cos(5-cos(5i2). (6.5) 



Let us show that the integration in Eq. ( |6.1|) with the weight function ( |6.5|) does indeed 
provide us with the same result (|6.2|). The product of two 5T/T is the random process 



V = ^(ei) ^-(62). We have access to one realization of this process. To integrate the v over 
all directions ei, 62 on the sky separated by a fixed angle 6 one can proceed as follows. At 
the first step, rotate the vector 62 around the fixed direction defined by ei and integrate 
the V over the circle traced by the vector 62 on the sphere. The result will depend only on 
{9i,ipi) - the coordinates of the vector ci. At the second step, integrate the result over all 
61 and ipi, letting the vector el to run over the whole sphere. The final result, taking into 
account also the factor I/Stt^ in ( |6.5| ), should be the random variable we are interested in. 
Let us do this computation in practice. 

Every function on a sphere can be expanded in the basis of spherical harmonics. In 
particular, the function Yim{0, ip) can be expanded in the basis {Yir{9', ^p'); I > 0, —I < r < 1} 
according to the formula: 
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The coefficients of this expansion are called the Wigner D-functions [|T8|. They depend on 
the Euler angles a, /?, 7 describing the rotation which transforms the direction {6, (f) into 
the direction {9', ip'). Since the rotation specified by the angles a, P = —9i, 7 = —ipi brings 
the vector pointing out to the north pole to the direction defined by (6*1, ipi), we can write: 

I 

Ylmiei) = E ^'-(0' -)^r„^(«, "^1, '^l)- (6-6) 

r=-l 

In the same manner, the 1^^(62) can be expressed as 

¥,,(62) = E YU6, x)D%ia, -e,, -cp,). (6.7) 

Indeed, when the vector ci points out to the north pole, the ^-coordinate of 6*2 is simply 6. 
Using the definition ( |2.4| ) and the formulae (|6.(j|), (|6.7D, we can present the random 
process v as 

'v = J2J2J2 cimCpqYiriO, -)Yps{6, x)Dl.^{a, -Oi, -(/?i)D^^(a, -Oi, -ipi). 

Im pq rs 

Let us now perform the two step integration procedure described above. The first step 
amounts to the integration of v over the angle x from to 27r. Using the explicit form of 
the spherical harmonics given in the Appendix, we get: 



r2iT 2/ + 1 

/ vdx = Y.Y.Y. ClmCpg{——Y^^5, 



2/ + l,i/2, ,2p+l,i 



/2 



-r,. Art 

Im pq rs ^" ^" 



(p-s) 



r2n 



Xp + s) 

Pps{cosS)DlJa, -9,, ~Vi)DPJa, -9,, -^,) ^" e^^^dx- 
In the last expression, the integral ^^^ e*'^^dx is simply 2t:5so- Using the relationship 



V 47r 
we obtain 



2/ + 1 

DL(«,/?,7) = llm(/?,7), 



r2-K 

I vdx = 27rEEc'"^'^P9^p('^°S'^)^'»(^i''^i)^P'?(^i'V^i)- 



As expected, the result depends only on {6ij(fi). In order to complete the procedure, we 
have to integrate this result over fii. We find 

//'27r r 

dill / vdx = 2nJ2J2'^iniCpqPp{cos6) / dniYirrX9i,(pi)Ypq{9i,ipi) 

-'^ Im pq •' 

00 

= 27rE^Kcos(5)cf. 

«=0 



Restoring the factor I/Stt from (|6.5|), we arrive at the best unbiased estimator (|6.2|). The 



same procedure performed over a given map provides us with the best unbiased estimate of 
the correlation function K{5). 

21 



VII. BEST ESTIMATES AND THE ERROR BARS 

We will denote the best unbiased estimate for the parameter af by {<jf)Bu- Along with 
this one, there exist other estimates of the same parameter, for example, the maximum- 
likelihood estimate (erf) ml- Not surprisingly, for the postulated distributions (|3.1| ), (p.2D, 
( ^.31 ) these estimates coincide. 

Apparently the most "naive" evaluation of the af, giving nevertheless the correct result, 
would be the following one. From a given map one derives the set of the observed coefficients 
6;™ (that is to say, the set {bf^, bf^, bf^}, m > 1). We know (postulate) that each of them 
is drawn from the normal zero-mean distribution 

if 

f{blrn) = ^^e~^. (7.1) 



Each of the observed bim coefficients can be used for the maximum-likelihood evaluation 
of the corresponding af. [We omit the label (map) when it is clear that we deal with the 
observed quantities.] One finds this estimate, denoted af,^-., by assuming that the p.d.f. 
( [7. 1|) reaches its maximum at the observed bim. The result is known: af(^ = bf^. Indeed, 

ln/ = -il.W)-|| + C. 
where C is a constant. Then, one gets 



din/ 1 62 



+ 



Im 



daf 2af 2ar 



and the condition 

din/ 

daf 



(7.2) 



leads to the stated result. Since for every / we have 2/ + 1 independent bim coefficients and, 
hence, 2/ + 1 independent evaluations, the estimate of the true af is given by 



1^2 1 V-.2 hf 



^ Z^ ^l(m) - 2/ + 1 ^ ^'"^ 



21 + 1^ '^"^ 2/ + lt;- "" 2/ + 1 



This number coincides with the {af)Bu determined by Eq. ( ^.17| ). 

A similar maximum-likelihood evaluation of af is based on the joint p.d.f. for all coef- 
ficients bim (with the same index /) which is simply the product of the individual p.d.f. 's 

Oil 



1 -^ 

fi^mMmMm) = (2t^^2\ {21+1)1 2^ '"' 

Imposing the condition ( [7.2|) , we arrive at the same result 

1)2 

{o-i )ml = 21 j^ I ^ *^^' )'^'^- 
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Finally, we can give the maximum-likelihood estimation based on the p.d.f. ( |3.9| ) for the 
quantities bf. Repeating the same steps, we come again to the result 

{crf)ML = ^. (7.3) 

n 

Thus, in the first approximation, we can write for the true value of af: af = {(jf)BU- 
In the next approximation, we want to make this statement more accurate by assigning the 
error bars. This is the matter of definitions, and there are many ways of doing this. We will 
use the distribution function ( |3.9| ). This function attains its maximum at the measured b^ 
and the estimated {<jf)ML, Eq. ( |7.3| ). The value of the / at the maximum is 

Jmax 



(n/2-l)!(2/n)'^/2' 

When the af, treated as a variable parameter, deviates from (erf) ml, the value of the / 
decreases as compared with the fmax- We establish the error bars for af by requiring that 
the value of / does not drop below some confidence level 

/ = kUa., (7.4) 

where A;, A; < 1, is a fixed number. Within the error box are included all af surrounding 
{<^f)ML and up to the boundaries {af)k determined by two solutions to the equation (|7.4D. 
Let us denote x = {af)ML/icrf)k- Equation ( [7.4| ) takes the form 

2 
\nx — X = — Infc — 1. (7.5) 

n 

Obviously, x = 1 ii k = 1. Let us now consider small deviations from this solution for A; < 1. 
We write x = 1 — y, where \y\ << 1. By expanding the Inx in terms of y and considering 
the first nonvanishing approximation to the equation ( |7.5| ), we find y"^ = — (4/n) In A;. That 
is, the two wanted solutions are 



n 



The condition of their applicability is — (4/ri)lnA; << 1. Thus, in this approximation, we 
can write the true af as 



2 ,^ , 2^/^\nk, 



The choice of k is in our hands. If the distribution f{z) were a normal zero- mean 
distribution, then a reasonable choice of k would be A; = e~^/^, because f{z = a) = e'^^"^ fmax- 
The ~x^ distribution (|3.9| ) is not a normal one, but approaches a normal distribution for large 
values of I. As a guidance, we will use k = e~^'^ in Eq. ( |7.6| ). Then we get 



af = (af)Rrr(l± 



'l )BU 



. 2/ 
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This formula becomes progressively inaccurate for small /. Specifically for / = 2 this 
formula would imply the error at the level ±0.6. However, a direct derivation of the error 
bars from equation ( |7.5| ) (and assuming k = e~^/^ ) gives 

where a lies between +1.0 and —0.4. Note the assymetry of the error interval: the larger 
than {(T2) Bu values are more tolerable than the smaller ones. 
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IX. APPENDIX 

The complex spherical harmonics Yim{0,<^) are defined by the expression: 

In this expression, / > and —l<m<l. The functions Yim{0,^) satisfy the relationship 
Y*^ = Yi^-rn- On the other hand, the real spherical harmonics are defined by the equations: 

/o/ 4-1(1- mVxVa 

/2l + 1 (/ - m)'\^/2 
YiU^.v) = y 27r il + m)\ ) Pirn{cos9)smm^, 

where / > but < m < I. The indices c, s indicate the presence of cos rrvp or sin rmp, 
respectively. The link between the complex and real spherical harmonics is given by 

Y,'r, = ^{Yi^ + Y;j m>0, 

Yi:^ = -^{Yi^-y;j m>0. 



The scalar product of two functions f{6, ip) and g{6, (f) on the sphere is defined by 

f2-K 

/o Jo 

Then, we have the following properties: 



if,g) = r d^ rd9sm9n9,^)gi9,v). 
JO Jo 
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(Xlm, yi'm') — Sll'Smm' — I < m < I, 

The functions ^ (/ > 0), 1^^ (/ > 1, / > m > 1), 1^^ (/ > 1, / > m > 1) form a complete 
orthonormal basis. 
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